A Novel Web Text Mining Method Using the Discrete Cosine Transform

نویسندگان

  • Laurence Anthony F. Park
  • Marimuthu Palaniswami
  • Kotagiri Ramamohanarao
چکیده

Fourier Domain Scoring (FDS) has been shown to give a 60% improvement in precision over the existing vector space methods, but its index requires a large storage space. We propose a new Web text mining method using the discrete cosine transform (DCT) to extract useful information from text documents and to provide improved document ranking, without having to store excessive data. While the new method preserves the performance of the FDS method, it gives a 40% improvement in precision over the established text mining methods when using only 20% of the storage space required by FDS.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Novelty Approach on Tamil Spam Text Extraction by Using Texton Template Based Support Vector Machine and Lp Boosting Classifier

In this proposed method, the Tamil language texts are analyzed through the Morris-Pratt Algorithm as input image that filtered with Gabor filter for edge analysis. Then, it converted into unique strings from the text blocks. The text strings consist of text stroke to analyze the pattern. By using wavelet transform, the features of pattern are extracted and it undergoes for mapping with the text...

متن کامل

A Novel Approach for Generation of All-optical Ofdm Using Discrete Cosine Transform Based on Optical Couplers in a Radio-over-fiber Link

A novel method for100Gbpsall-optical OFDM using Discrete Cosine Transform in a Radio-Over-Fiber link is proposed. The system is designed simply using both symmetric and asymmetric passive optical couplers. DCT is achieved all-optically by adjusting the length and splitting ratio of the couplers. The performance of the system is compared with all-optical OFDM based on Discrete Fourier, Discrete ...

متن کامل

3D Model Retrieval Based on 3D Discrete Cosine Transform

The content-based retrieval systems for 3D models on the Web become necessary since digital databases of 3D objects are growing. In this paper, we propose a new method to describe 3D models. This method is based on 3D discrete cosine transform which is applied for the voxelized 3D model. The discrete cosine transform is widely used for 2D image compression and it shows its performance for the J...

متن کامل

Web Based Novel Technique for Watermarking Colour Images on Android Mobile Phones

In this paper, a real time, ubiquitous and novel technique for automatically watermarking the color images for android mobile phone is discussed and demonstrated. The captured images are watermarked before it can be saved on memory or sd-card of mobile phone. No processing power of mobile phone is required for embedding or extraction processes. Hence, the proposed method consumes very less batt...

متن کامل

Multi-Focus Image Fusion in DCT Domain using Variance and Energy of Laplacian and Correlation Coefficient for Visual Sensor Networks

The purpose of multi-focus image fusion is gathering the essential information and the focused parts from the input multi-focus images into a single image. These multi-focus images are captured with different depths of focus of cameras. A lot of multi-focus image fusion techniques have been introduced using considering the focus measurement in the spatial domain. However, the multi-focus image ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002